Overview
Brought to you by YData
Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 500 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 135.8 KiB |
| Average record size in memory | 278.0 B |
Variable types
| Text | 1 |
|---|---|
| Numeric | 7 |
| Categorical | 6 |
| DateTime | 1 |
| Boolean | 4 |
Annual_Income is highly overall correlated with Cluster | High correlation |
Cluster is highly overall correlated with Annual_Income | High correlation |
Gender is highly overall correlated with Gender_Encoded | High correlation |
Gender_Encoded is highly overall correlated with Gender | High correlation |
Membership_Encoded is highly overall correlated with Membership_Status | High correlation |
Membership_Status is highly overall correlated with Membership_Encoded | High correlation |
Purchase_Amount is highly overall correlated with Z_Purchase_Amount | High correlation |
Z_Purchase_Amount is highly overall correlated with Purchase_Amount | High correlation |
Customer_ID has unique values | Unique |
Annual_Income has unique values | Unique |
Reproduction
| Analysis started | 2025-04-14 12:57:53.745732 |
|---|---|
| Analysis finished | 2025-04-14 12:58:00.757351 |
| Duration | 7.01 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
Customer_ID
Text
Unique 
| Distinct | 500 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.9 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 500 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | CUST0000 |
|---|---|
| 2nd row | CUST0001 |
| 3rd row | CUST0002 |
| 4th row | CUST0003 |
| 5th row | CUST0004 |
| Value | Count | Frequency (%) |
| cust0004 | 1 | 0.2% |
| cust0499 | 1 | 0.2% |
| cust0000 | 1 | 0.2% |
| cust0001 | 1 | 0.2% |
| cust0484 | 1 | 0.2% |
| cust0485 | 1 | 0.2% |
| cust0486 | 1 | 0.2% |
| cust0487 | 1 | 0.2% |
| cust0488 | 1 | 0.2% |
| cust0489 | 1 | 0.2% |
| Other values (490) | 490 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 700 | |
| C | 500 | |
| U | 500 | |
| S | 500 | |
| T | 500 | |
| 4 | 200 | 5.0% |
| 1 | 200 | 5.0% |
| 2 | 200 | 5.0% |
| 3 | 200 | 5.0% |
| 9 | 100 | 2.5% |
| Other values (4) | 400 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 700 | |
| C | 500 | |
| U | 500 | |
| S | 500 | |
| T | 500 | |
| 4 | 200 | 5.0% |
| 1 | 200 | 5.0% |
| 2 | 200 | 5.0% |
| 3 | 200 | 5.0% |
| 9 | 100 | 2.5% |
| Other values (4) | 400 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 700 | |
| C | 500 | |
| U | 500 | |
| S | 500 | |
| T | 500 | |
| 4 | 200 | 5.0% |
| 1 | 200 | 5.0% |
| 2 | 200 | 5.0% |
| 3 | 200 | 5.0% |
| 9 | 100 | 2.5% |
| Other values (4) | 400 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 700 | |
| C | 500 | |
| U | 500 | |
| S | 500 | |
| T | 500 | |
| 4 | 200 | 5.0% |
| 1 | 200 | 5.0% |
| 2 | 200 | 5.0% |
| 3 | 200 | 5.0% |
| 9 | 100 | 2.5% |
| Other values (4) | 400 |
Age
Real number (ℝ)
| Distinct | 52 |
|---|---|
| Distinct (%) | 10.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 44.22 |
| Minimum | 18 |
|---|---|
| Maximum | 69 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 32 |
| median | 45 |
| Q3 | 57 |
| 95-th percentile | 67 |
| Maximum | 69 |
| Range | 51 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 15.036082 |
|---|---|
| Coefficient of variation (CV) | 0.340029 |
| Kurtosis | -1.1069905 |
| Mean | 44.22 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | -0.11270863 |
| Sum | 22110 |
| Variance | 226.08377 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50 | 18 | 3.6% |
| 52 | 16 | 3.2% |
| 49 | 15 | 3.0% |
| 41 | 15 | 3.0% |
| 61 | 14 | 2.8% |
| 19 | 14 | 2.8% |
| 69 | 14 | 2.8% |
| 56 | 13 | 2.6% |
| 45 | 13 | 2.6% |
| 42 | 12 | 2.4% |
| Other values (42) | 356 |
| Value | Count | Frequency (%) |
| 18 | 12 | |
| 19 | 14 | |
| 20 | 11 | |
| 21 | 8 | |
| 22 | 6 | |
| 23 | 9 | |
| 24 | 9 | |
| 25 | 12 | |
| 26 | 10 | |
| 27 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 69 | 14 | |
| 68 | 10 | |
| 67 | 6 | |
| 66 | 12 | |
| 65 | 12 | |
| 64 | 9 | |
| 63 | 6 | |
| 62 | 9 | |
| 61 | 14 | |
| 60 | 4 | 0.8% |
Gender
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.4 KiB |
| Male | |
|---|---|
| Female | |
| Other |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 4.942 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Male |
|---|---|
| 2nd row | Other |
| 3rd row | Other |
| 4th row | Male |
| 5th row | Female |
Common Values
| Value | Count | Frequency (%) |
| Male | 188 | |
| Female | 159 | |
| Other | 153 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 188 | |
| female | 159 | |
| other | 153 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 659 | |
| a | 347 | |
| l | 347 | |
| M | 188 | 7.6% |
| F | 159 | 6.4% |
| m | 159 | 6.4% |
| O | 153 | 6.2% |
| t | 153 | 6.2% |
| h | 153 | 6.2% |
| r | 153 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2471 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 659 | |
| a | 347 | |
| l | 347 | |
| M | 188 | 7.6% |
| F | 159 | 6.4% |
| m | 159 | 6.4% |
| O | 153 | 6.2% |
| t | 153 | 6.2% |
| h | 153 | 6.2% |
| r | 153 | 6.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2471 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 659 | |
| a | 347 | |
| l | 347 | |
| M | 188 | 7.6% |
| F | 159 | 6.4% |
| m | 159 | 6.4% |
| O | 153 | 6.2% |
| t | 153 | 6.2% |
| h | 153 | 6.2% |
| r | 153 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2471 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 659 | |
| a | 347 | |
| l | 347 | |
| M | 188 | 7.6% |
| F | 159 | 6.4% |
| m | 159 | 6.4% |
| O | 153 | 6.2% |
| t | 153 | 6.2% |
| h | 153 | 6.2% |
| r | 153 | 6.2% |
Annual_Income
Real number (ℝ)
High correlation  Unique 
| Distinct | 500 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 85364.862 |
| Minimum | 20077 |
|---|---|
| Maximum | 149948 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 20077 |
|---|---|
| 5-th percentile | 24419.95 |
| Q1 | 50418.5 |
| median | 87475 |
| Q3 | 120223.5 |
| 95-th percentile | 145887.6 |
| Maximum | 149948 |
| Range | 129871 |
| Interquartile range (IQR) | 69805 |
Descriptive statistics
| Standard deviation | 39127.187 |
|---|---|
| Coefficient of variation (CV) | 0.45835237 |
| Kurtosis | -1.2290321 |
| Mean | 85364.862 |
| Median Absolute Deviation (MAD) | 33998 |
| Skewness | -0.016726739 |
| Sum | 42682431 |
| Variance | 1.5309367 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 80500 | 1 | 0.2% |
| 102152 | 1 | 0.2% |
| 145036 | 1 | 0.2% |
| 144049 | 1 | 0.2% |
| 32175 | 1 | 0.2% |
| 104458 | 1 | 0.2% |
| 133453 | 1 | 0.2% |
| 80535 | 1 | 0.2% |
| 95039 | 1 | 0.2% |
| 51821 | 1 | 0.2% |
| Other values (490) | 490 |
| Value | Count | Frequency (%) |
| 20077 | 1 | |
| 20117 | 1 | |
| 20126 | 1 | |
| 20235 | 1 | |
| 20814 | 1 | |
| 20846 | 1 | |
| 20922 | 1 | |
| 21342 | 1 | |
| 21605 | 1 | |
| 21645 | 1 |
| Value | Count | Frequency (%) |
| 149948 | 1 | |
| 149922 | 1 | |
| 149597 | 1 | |
| 149038 | 1 | |
| 149028 | 1 | |
| 148876 | 1 | |
| 148644 | 1 | |
| 148177 | 1 | |
| 147978 | 1 | |
| 147796 | 1 |
Spending_Score
Real number (ℝ)
| Distinct | 101 |
|---|---|
| Distinct (%) | 20.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.472 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 2 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 25 |
| median | 49.5 |
| Q3 | 76.25 |
| 95-th percentile | 97 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 51.25 |
Descriptive statistics
| Standard deviation | 29.724608 |
|---|---|
| Coefficient of variation (CV) | 0.58893264 |
| Kurtosis | -1.2325387 |
| Mean | 50.472 |
| Median Absolute Deviation (MAD) | 25.5 |
| Skewness | 0.058069184 |
| Sum | 25236 |
| Variance | 883.55232 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 97 | 17 | 3.4% |
| 98 | 11 | 2.2% |
| 58 | 11 | 2.2% |
| 24 | 10 | 2.0% |
| 28 | 10 | 2.0% |
| 39 | 10 | 2.0% |
| 16 | 10 | 2.0% |
| 61 | 9 | 1.8% |
| 86 | 9 | 1.8% |
| 38 | 9 | 1.8% |
| Other values (91) | 394 |
| Value | Count | Frequency (%) |
| 0 | 2 | 0.4% |
| 1 | 4 | |
| 2 | 5 | |
| 3 | 4 | |
| 4 | 7 | |
| 5 | 4 | |
| 6 | 4 | |
| 7 | 1 | 0.2% |
| 8 | 6 | |
| 9 | 5 |
| Value | Count | Frequency (%) |
| 100 | 5 | 1.0% |
| 99 | 4 | 0.8% |
| 98 | 11 | |
| 97 | 17 | |
| 96 | 1 | 0.2% |
| 95 | 6 | 1.2% |
| 94 | 3 | 0.6% |
| 93 | 5 | 1.0% |
| 92 | 7 | |
| 91 | 3 | 0.6% |
Purchase_Amount
Real number (ℝ)
High correlation 
| Distinct | 498 |
|---|---|
| Distinct (%) | 99.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 503.09406 |
| Minimum | 14.89 |
|---|---|
| Maximum | 999.42 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 14.89 |
|---|---|
| 5-th percentile | 66.2665 |
| Q1 | 243.3425 |
| median | 505.72 |
| Q3 | 733.5725 |
| 95-th percentile | 956.074 |
| Maximum | 999.42 |
| Range | 984.53 |
| Interquartile range (IQR) | 490.23 |
Descriptive statistics
| Standard deviation | 286.51389 |
|---|---|
| Coefficient of variation (CV) | 0.56950363 |
| Kurtosis | -1.1917615 |
| Mean | 503.09406 |
| Median Absolute Deviation (MAD) | 251.545 |
| Skewness | -0.0053721663 |
| Sum | 251547.03 |
| Variance | 82090.211 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 245.17 | 2 | 0.4% |
| 869.59 | 2 | 0.4% |
| 841.46 | 1 | 0.2% |
| 416.92 | 1 | 0.2% |
| 614.87 | 1 | 0.2% |
| 219.92 | 1 | 0.2% |
| 235.8 | 1 | 0.2% |
| 885.36 | 1 | 0.2% |
| 821 | 1 | 0.2% |
| 605.6 | 1 | 0.2% |
| Other values (488) | 488 |
| Value | Count | Frequency (%) |
| 14.89 | 1 | |
| 15.7 | 1 | |
| 17.49 | 1 | |
| 19.67 | 1 | |
| 21.5 | 1 | |
| 28.05 | 1 | |
| 33.4 | 1 | |
| 35.39 | 1 | |
| 35.55 | 1 | |
| 39.67 | 1 |
| Value | Count | Frequency (%) |
| 999.42 | 1 | |
| 996.37 | 1 | |
| 994.2 | 1 | |
| 992.56 | 1 | |
| 992.24 | 1 | |
| 991.34 | 1 | |
| 987.91 | 1 | |
| 987.77 | 1 | |
| 986.8 | 1 | |
| 986.76 | 1 |
Transaction_Frequency
Real number (ℝ)
| Distinct | 49 |
|---|---|
| Distinct (%) | 9.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.338 |
| Minimum | 1 |
|---|---|
| Maximum | 49 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 14 |
| median | 24 |
| Q3 | 37 |
| 95-th percentile | 47 |
| Maximum | 49 |
| Range | 48 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 13.717438 |
|---|---|
| Coefficient of variation (CV) | 0.54137807 |
| Kurtosis | -1.1338052 |
| Mean | 25.338 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 0.0091648999 |
| Sum | 12669 |
| Variance | 188.16809 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 21 | 24 | 4.8% |
| 4 | 14 | 2.8% |
| 24 | 14 | 2.8% |
| 37 | 13 | 2.6% |
| 17 | 13 | 2.6% |
| 22 | 13 | 2.6% |
| 49 | 13 | 2.6% |
| 39 | 12 | 2.4% |
| 36 | 12 | 2.4% |
| 32 | 12 | 2.4% |
| Other values (39) | 360 |
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 2 | 6 | |
| 3 | 6 | |
| 4 | 14 | |
| 5 | 11 | |
| 6 | 10 | |
| 7 | 10 | |
| 8 | 9 | |
| 9 | 9 | |
| 10 | 11 |
| Value | Count | Frequency (%) |
| 49 | 13 | |
| 48 | 7 | |
| 47 | 8 | |
| 46 | 11 | |
| 45 | 11 | |
| 44 | 10 | |
| 43 | 9 | |
| 42 | 10 | |
| 41 | 10 | |
| 40 | 11 |
Membership_Status
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 30.8 KiB |
| Platinum | |
|---|---|
| Gold | |
| Basic | |
| Silver |
Length
| Max length | 8 |
|---|---|
| Median length | 6 |
| Mean length | 5.84 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Silver |
|---|---|
| 2nd row | Basic |
| 3rd row | Silver |
| 4th row | Basic |
| 5th row | Basic |
Common Values
| Value | Count | Frequency (%) |
| Platinum | 142 | |
| Gold | 122 | |
| Basic | 120 | |
| Silver | 116 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| platinum | 142 | |
| gold | 122 | |
| basic | 120 | |
| silver | 116 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 380 | |
| i | 378 | 12.9% |
| a | 262 | 9.0% |
| P | 142 | 4.9% |
| t | 142 | 4.9% |
| n | 142 | 4.9% |
| u | 142 | 4.9% |
| m | 142 | 4.9% |
| G | 122 | 4.2% |
| o | 122 | 4.2% |
| Other values (8) | 946 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2920 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 380 | |
| i | 378 | 12.9% |
| a | 262 | 9.0% |
| P | 142 | 4.9% |
| t | 142 | 4.9% |
| n | 142 | 4.9% |
| u | 142 | 4.9% |
| m | 142 | 4.9% |
| G | 122 | 4.2% |
| o | 122 | 4.2% |
| Other values (8) | 946 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2920 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 380 | |
| i | 378 | 12.9% |
| a | 262 | 9.0% |
| P | 142 | 4.9% |
| t | 142 | 4.9% |
| n | 142 | 4.9% |
| u | 142 | 4.9% |
| m | 142 | 4.9% |
| G | 122 | 4.2% |
| o | 122 | 4.2% |
| Other values (8) | 946 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2920 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 380 | |
| i | 378 | 12.9% |
| a | 262 | 9.0% |
| P | 142 | 4.9% |
| t | 142 | 4.9% |
| n | 142 | 4.9% |
| u | 142 | 4.9% |
| m | 142 | 4.9% |
| G | 122 | 4.2% |
| o | 122 | 4.2% |
| Other values (8) | 946 |
Purchase_Date
Date
| Distinct | 360 |
|---|---|
| Distinct (%) | 72.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.0 KiB |
| Minimum | 2023-04-14 00:00:00 |
|---|---|
| Maximum | 2025-04-13 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Z_Purchase_Amount
Real number (ℝ)
High correlation 
| Distinct | 498 |
|---|---|
| Distinct (%) | 99.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -4.9737992 × 10-17 |
| Minimum | -1.705652 |
|---|---|
| Maximum | 1.7340276 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 248 |
| Negative (%) | 49.6% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | -1.705652 |
|---|---|
| 5-th percentile | -1.5261565 |
| Q1 | -0.9075012 |
| median | 0.0091743191 |
| Q3 | 0.80522889 |
| 95-th percentile | 1.5825885 |
| Maximum | 1.7340276 |
| Range | 3.4396797 |
| Interquartile range (IQR) | 1.7127301 |
Descriptive statistics
| Standard deviation | 1.0010015 |
|---|---|
| Coefficient of variation (CV) | -2.0125491 × 1016 |
| Kurtosis | -1.1917615 |
| Mean | -4.9737992 × 10-17 |
| Median Absolute Deviation (MAD) | 0.87882971 |
| Skewness | -0.0053721663 |
| Sum | -1.7319479 × 10-14 |
| Variance | 1.002004 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -0.9011164125 | 2 | 0.4% |
| 1.280436989 | 2 | 0.4% |
| 1.182158431 | 1 | 0.2% |
| -0.3010686936 | 1 | 0.2% |
| 0.3905146889 | 1 | 0.2% |
| -0.989333035 | 1 | 0.2% |
| -0.9338526403 | 1 | 0.2% |
| 1.335533073 | 1 | 0.2% |
| 1.110676763 | 1 | 0.2% |
| 0.3581278339 | 1 | 0.2% |
| Other values (488) | 488 |
| Value | Count | Frequency (%) |
| -1.705652009 | 1 | |
| -1.70282209 | 1 | |
| -1.696568317 | 1 | |
| -1.688951991 | 1 | |
| -1.68255847 | 1 | |
| -1.659674554 | 1 | |
| -1.640983111 | 1 | |
| -1.634030593 | 1 | |
| -1.633471596 | 1 | |
| -1.619077439 | 1 |
| Value | Count | Frequency (%) |
| 1.734027646 | 1 | |
| 1.723371776 | 1 | |
| 1.715790387 | 1 | |
| 1.710060674 | 1 | |
| 1.708942681 | 1 | |
| 1.705798326 | 1 | |
| 1.693814841 | 1 | |
| 1.693325719 | 1 | |
| 1.689936803 | 1 | |
| 1.689797054 | 1 |
Gender_Encoded
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.4 KiB |
| 1 | |
|---|---|
| 0 | |
| 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 188 | |
| 0 | 159 | |
| 2 | 153 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 188 | |
| 0 | 159 | |
| 2 | 153 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 188 | |
| 0 | 159 | |
| 2 | 153 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 500 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 188 | |
| 0 | 159 | |
| 2 | 153 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 500 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 188 | |
| 0 | 159 | |
| 2 | 153 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 500 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 188 | |
| 0 | 159 | |
| 2 | 153 |
Membership_Encoded
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.4 KiB |
| 2 | |
|---|---|
| 1 | |
| 0 | |
| 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 0 |
| 3rd row | 3 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 142 | |
| 1 | 122 | |
| 0 | 120 | |
| 3 | 116 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 142 | |
| 1 | 122 | |
| 0 | 120 | |
| 3 | 116 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 142 | |
| 1 | 122 | |
| 0 | 120 | |
| 3 | 116 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 500 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 142 | |
| 1 | 122 | |
| 0 | 120 | |
| 3 | 116 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 500 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 142 | |
| 1 | 122 | |
| 0 | 120 | |
| 3 | 116 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 500 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 142 | |
| 1 | 122 | |
| 0 | 120 | |
| 3 | 116 |
Category_Purchased_Clothing
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 632.0 B |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 412 | |
| True | 88 | 17.6% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 632.0 B |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 398 | |
| True | 102 | 20.4% |
Category_Purchased_Groceries
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 632.0 B |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 396 | |
| True | 104 | 20.8% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 632.0 B |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 390 | |
| True | 110 | 22.0% |
Cluster
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.4 KiB |
| 2 | |
|---|---|
| 1 | |
| 0 | |
| 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 140 | |
| 1 | 130 | |
| 0 | 124 | |
| 3 | 106 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 140 | |
| 1 | 130 | |
| 0 | 124 | |
| 3 | 106 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 140 | |
| 1 | 130 | |
| 0 | 124 | |
| 3 | 106 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 500 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 140 | |
| 1 | 130 | |
| 0 | 124 | |
| 3 | 106 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 500 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 140 | |
| 1 | 130 | |
| 0 | 124 | |
| 3 | 106 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 500 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 140 | |
| 1 | 130 | |
| 0 | 124 | |
| 3 | 106 |
Purchase_Month
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.458 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.4825986 |
|---|---|
| Coefficient of variation (CV) | 0.53926891 |
| Kurtosis | -1.2063735 |
| Mean | 6.458 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.032647111 |
| Sum | 3229 |
| Variance | 12.128493 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 49 | |
| 8 | 47 | |
| 9 | 45 | |
| 4 | 44 | |
| 2 | 43 | |
| 1 | 43 | |
| 3 | 42 | |
| 5 | 40 | |
| 7 | 40 | |
| 6 | 39 | |
| Other values (2) | 68 |
| Value | Count | Frequency (%) |
| 1 | 43 | |
| 2 | 43 | |
| 3 | 42 | |
| 4 | 44 | |
| 5 | 40 | |
| 6 | 39 | |
| 7 | 40 | |
| 8 | 47 | |
| 9 | 45 | |
| 10 | 33 |
| Value | Count | Frequency (%) |
| 12 | 49 | |
| 11 | 35 | |
| 10 | 33 | |
| 9 | 45 | |
| 8 | 47 | |
| 7 | 40 | |
| 6 | 39 | |
| 5 | 40 | |
| 4 | 44 | |
| 3 | 42 |
Purchase_Year
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 29.9 KiB |
| 2024 | |
|---|---|
| 2023 | |
| 2025 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2024 |
|---|---|
| 2nd row | 2024 |
| 3rd row | 2025 |
| 4th row | 2023 |
| 5th row | 2025 |
Common Values
| Value | Count | Frequency (%) |
| 2024 | 255 | |
| 2023 | 175 | |
| 2025 | 70 | 14.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2024 | 255 | |
| 2023 | 175 | |
| 2025 | 70 | 14.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1000 | |
| 0 | 500 | |
| 4 | 255 | 12.8% |
| 3 | 175 | 8.8% |
| 5 | 70 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 1000 | |
| 0 | 500 | |
| 4 | 255 | 12.8% |
| 3 | 175 | 8.8% |
| 5 | 70 | 3.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 1000 | |
| 0 | 500 | |
| 4 | 255 | 12.8% |
| 3 | 175 | 8.8% |
| 5 | 70 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 1000 | |
| 0 | 500 | |
| 4 | 255 | 12.8% |
| 3 | 175 | 8.8% |
| 5 | 70 | 3.5% |
Interactions
Correlations
| Age | Annual_Income | Category_Purchased_Clothing | Category_Purchased_Electronics | Category_Purchased_Groceries | Category_Purchased_Home Decor | Cluster | Gender | Gender_Encoded | Membership_Encoded | Membership_Status | Purchase_Amount | Purchase_Month | Purchase_Year | Spending_Score | Transaction_Frequency | Z_Purchase_Amount | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Age | 1.000 | -0.060 | 0.064 | 0.110 | 0.000 | 0.000 | 0.000 | 0.065 | 0.065 | 0.027 | 0.027 | 0.008 | -0.023 | 0.000 | -0.014 | -0.013 | 0.008 |
| Annual_Income | -0.060 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.910 | 0.000 | 0.000 | 0.069 | 0.069 | -0.058 | 0.043 | 0.015 | -0.048 | 0.029 | -0.058 |
| Category_Purchased_Clothing | 0.064 | 0.000 | 1.000 | 0.223 | 0.226 | 0.235 | 0.000 | 0.000 | 0.000 | 0.036 | 0.036 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| Category_Purchased_Electronics | 0.110 | 0.000 | 0.223 | 1.000 | 0.250 | 0.259 | 0.000 | 0.000 | 0.000 | 0.017 | 0.017 | 0.000 | 0.000 | 0.072 | 0.147 | 0.016 | 0.000 |
| Category_Purchased_Groceries | 0.000 | 0.000 | 0.226 | 0.250 | 1.000 | 0.263 | 0.000 | 0.047 | 0.047 | 0.048 | 0.048 | 0.083 | 0.000 | 0.054 | 0.000 | 0.099 | 0.083 |
| Category_Purchased_Home Decor | 0.000 | 0.000 | 0.235 | 0.259 | 0.263 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.038 | 0.087 | 0.000 | 0.000 | 0.071 | 0.038 |
| Cluster | 0.000 | 0.910 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.056 | 0.056 | 0.073 | 0.064 | 0.000 | 0.000 | 0.059 | 0.073 |
| Gender | 0.065 | 0.000 | 0.000 | 0.000 | 0.047 | 0.000 | 0.000 | 1.000 | 1.000 | 0.000 | 0.000 | 0.055 | 0.105 | 0.090 | 0.104 | 0.000 | 0.055 |
| Gender_Encoded | 0.065 | 0.000 | 0.000 | 0.000 | 0.047 | 0.000 | 0.000 | 1.000 | 1.000 | 0.000 | 0.000 | 0.055 | 0.105 | 0.090 | 0.104 | 0.000 | 0.055 |
| Membership_Encoded | 0.027 | 0.069 | 0.036 | 0.017 | 0.048 | 0.000 | 0.056 | 0.000 | 0.000 | 1.000 | 1.000 | 0.000 | 0.033 | 0.017 | 0.000 | 0.000 | 0.000 |
| Membership_Status | 0.027 | 0.069 | 0.036 | 0.017 | 0.048 | 0.000 | 0.056 | 0.000 | 0.000 | 1.000 | 1.000 | 0.000 | 0.033 | 0.017 | 0.000 | 0.000 | 0.000 |
| Purchase_Amount | 0.008 | -0.058 | 0.000 | 0.000 | 0.083 | 0.038 | 0.073 | 0.055 | 0.055 | 0.000 | 0.000 | 1.000 | -0.035 | 0.000 | 0.055 | 0.060 | 1.000 |
| Purchase_Month | -0.023 | 0.043 | 0.000 | 0.000 | 0.000 | 0.087 | 0.064 | 0.105 | 0.105 | 0.033 | 0.033 | -0.035 | 1.000 | 0.475 | -0.014 | 0.007 | -0.035 |
| Purchase_Year | 0.000 | 0.015 | 0.000 | 0.072 | 0.054 | 0.000 | 0.000 | 0.090 | 0.090 | 0.017 | 0.017 | 0.000 | 0.475 | 1.000 | 0.000 | 0.128 | 0.000 |
| Spending_Score | -0.014 | -0.048 | 0.000 | 0.147 | 0.000 | 0.000 | 0.000 | 0.104 | 0.104 | 0.000 | 0.000 | 0.055 | -0.014 | 0.000 | 1.000 | -0.030 | 0.055 |
| Transaction_Frequency | -0.013 | 0.029 | 0.000 | 0.016 | 0.099 | 0.071 | 0.059 | 0.000 | 0.000 | 0.000 | 0.000 | 0.060 | 0.007 | 0.128 | -0.030 | 1.000 | 0.060 |
| Z_Purchase_Amount | 0.008 | -0.058 | 0.000 | 0.000 | 0.083 | 0.038 | 0.073 | 0.055 | 0.055 | 0.000 | 0.000 | 1.000 | -0.035 | 0.000 | 0.055 | 0.060 | 1.000 |
Missing values
Sample
| Customer_ID | Age | Gender | Annual_Income | Spending_Score | Purchase_Amount | Transaction_Frequency | Membership_Status | Purchase_Date | Z_Purchase_Amount | Gender_Encoded | Membership_Encoded | Category_Purchased_Clothing | Category_Purchased_Electronics | Category_Purchased_Groceries | Category_Purchased_Home Decor | Cluster | Purchase_Month | Purchase_Year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | CUST0000 | 56 | Male | 102152 | 18 | 228.56 | 11 | Silver | 2024-05-28 | -0.959147 | 1 | 3 | False | False | False | True | 1 | 5 | 2024 |
| 1 | CUST0001 | 69 | Other | 145036 | 18 | 91.56 | 31 | Basic | 2024-02-03 | -1.437788 | 2 | 0 | False | True | False | False | 2 | 2 | 2024 |
| 2 | CUST0002 | 46 | Other | 144049 | 35 | 683.69 | 15 | Silver | 2025-01-31 | 0.630953 | 2 | 3 | True | False | False | False | 2 | 1 | 2025 |
| 3 | CUST0003 | 32 | Male | 46734 | 28 | 657.97 | 9 | Basic | 2023-12-13 | 0.541094 | 1 | 0 | False | False | True | False | 0 | 12 | 2023 |
| 4 | CUST0004 | 60 | Female | 26371 | 59 | 280.53 | 23 | Basic | 2025-03-11 | -0.777578 | 0 | 0 | False | True | False | False | 0 | 3 | 2025 |
| 5 | CUST0005 | 25 | Male | 138894 | 81 | 951.35 | 19 | Silver | 2023-11-17 | 1.566084 | 1 | 3 | False | True | False | False | 2 | 11 | 2023 |
| 6 | CUST0006 | 38 | Other | 46069 | 1 | 159.55 | 9 | Silver | 2023-04-14 | -1.200249 | 2 | 3 | False | False | False | True | 0 | 4 | 2023 |
| 7 | CUST0007 | 56 | Other | 99905 | 0 | 438.01 | 35 | Platinum | 2023-07-10 | -0.227386 | 2 | 2 | False | True | False | False | 1 | 7 | 2023 |
| 8 | CUST0008 | 36 | Female | 32910 | 46 | 944.18 | 21 | Basic | 2023-12-24 | 1.541034 | 0 | 0 | False | True | False | False | 0 | 12 | 2023 |
| 9 | CUST0009 | 40 | Other | 93479 | 68 | 425.53 | 9 | Silver | 2024-03-04 | -0.270988 | 2 | 3 | False | True | False | False | 1 | 3 | 2024 |
| Customer_ID | Age | Gender | Annual_Income | Spending_Score | Purchase_Amount | Transaction_Frequency | Membership_Status | Purchase_Date | Z_Purchase_Amount | Gender_Encoded | Membership_Encoded | Category_Purchased_Clothing | Category_Purchased_Electronics | Category_Purchased_Groceries | Category_Purchased_Home Decor | Cluster | Purchase_Month | Purchase_Year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 490 | CUST0490 | 18 | Other | 133374 | 28 | 114.61 | 43 | Basic | 2025-02-01 | -1.357257 | 2 | 0 | False | False | False | True | 2 | 2 | 2025 |
| 491 | CUST0491 | 18 | Female | 86358 | 3 | 644.42 | 41 | Silver | 2023-04-22 | 0.493754 | 0 | 3 | True | False | False | False | 1 | 4 | 2023 |
| 492 | CUST0492 | 64 | Female | 37327 | 80 | 223.88 | 39 | Platinum | 2024-06-14 | -0.975498 | 0 | 2 | False | False | False | True | 0 | 6 | 2024 |
| 493 | CUST0493 | 51 | Female | 67057 | 98 | 623.39 | 22 | Gold | 2024-08-12 | 0.420281 | 0 | 1 | False | False | False | True | 3 | 8 | 2024 |
| 494 | CUST0494 | 49 | Male | 26150 | 12 | 653.70 | 49 | Platinum | 2024-07-09 | 0.526176 | 1 | 2 | True | False | False | False | 0 | 7 | 2024 |
| 495 | CUST0495 | 65 | Male | 24425 | 38 | 160.50 | 17 | Platinum | 2023-10-09 | -1.196930 | 1 | 2 | False | True | False | False | 0 | 10 | 2023 |
| 496 | CUST0496 | 42 | Male | 143654 | 14 | 70.74 | 34 | Basic | 2024-04-04 | -1.510527 | 1 | 0 | False | False | False | False | 2 | 4 | 2024 |
| 497 | CUST0497 | 57 | Male | 36032 | 28 | 782.95 | 6 | Silver | 2023-08-25 | 0.977740 | 1 | 3 | True | False | False | False | 0 | 8 | 2023 |
| 498 | CUST0498 | 62 | Female | 22198 | 28 | 465.20 | 46 | Gold | 2024-10-21 | -0.132392 | 0 | 1 | False | False | True | False | 0 | 10 | 2024 |
| 499 | CUST0499 | 18 | Other | 80500 | 74 | 67.58 | 6 | Basic | 2023-12-11 | -1.521568 | 2 | 0 | False | True | False | False | 3 | 12 | 2023 |